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Summary 

We  have  continued  to  gain  insights  into  the  central  question  of  interactions 
between  biopolymers  that  are  essential  to  information  processing  and  control  in 
biological  systems.  The  work  largely  focuses  on  interactions  involving  nucleic  acids 
and  proteins  that  have  profound  influences  on  the  expression  of  genetic  information 
and  on  the  behavior  of  all  living  organisms.  Insights  from  studies  of  biological 
systems  have  been  translated  into  synthetic  chemical  approaches  to  the  creation  of 
powerful  and  specific  reagents  for  the  in  vitro  manipulation  of  genetic  material. 

In  addition,  progress  continues  to  be  made  in  elucidation  of  the  detailed 
molecular  structure  of  the  important  DNA  protein  complexes,  largely  by  nmr,  though 
x-ray  crystallography  is  beginning  to  hold  considerable  promise. 

Another  crucial  aspect  of  information  processing  concerns  the  charging  of  a 
particular  tRNA  by  its  cognate  amino  acid,  the  fundamental  control  step  in 
translating  a  nucleic  acid  sequence  into  an  amino  acid  sequence.  The  basic  aspects  of 
this  essential  decoding  process  have  been  another  focus  of  our  work  which  has  been 
successfully  pursued  in  the  past  year. 

JOHN  N.  Abelson 

In  previous  years  we  have  been  studying  the  problem  of  tRNA  identity  by 
changing  the  identity  of  one  tRNA  to  another.  By  changing  an  amber  suppressor 
tRNALeu  to  tRNASer  we  were  able  to  determine  much  about  those  features  of  a 
tRNASer  molecule  which  allow  its  unique  recognition  by  the  serine  tRNA  synthetase. 
We  are  continuing  this  same  approach  with  tRNAsLeu  and  tRNACys.  In  preliminary 
results  it  appears  that  we  have  been  able  to  change  tRNASer  to  tRNALeu  and 
tRNAAla  to  tRNACys. 

We  have  also  taken  a  completely  new  approach.  Drs.  Jeffrey  Sampson  and 
Margaret  Saks  have  made  a  review  of  existing  data  and  ideas  that  have  led  to  the 
deduction  of  putative  identity  sites  for  all  20  E.  coli  synthetases.  The  basic  premises 
of  the  approach  developed  for  deducing  recognition  sites  are  that  (1)  synthetase 
topology  dictates  regions  of  interaction  with  tRNAs  and  (2)  when  taken  individually, 
identity  elements  need  not  be  unique  to  only  one  isoaccepting  group.  Because  bases 
and  base  pairs  do  not  have  to  be  unique  to  be  recognition  elements,  synthetases  may 
interact  with  groups  of  adjacent  nucleotides  and  the  chemical  environments  that  they 
form.  To  test  this  idea,  we  analyzed  acceptor  stem  chemistry  as  a  series  of  trimers. 
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These  analyses  revealed  that  at  least  one  set  of  three  adjacent  nucleotides  formed  a 
unique  trimer  in  13  of  the  20  isoaccepting  groups. 

In  vitro  aminoacylation  assays  are  being  used  to  test  the  deduced  sites  and  the 
assumptions  made  about  the  nature  of  recognition  elements.  The  experiments  focus 
on  tRNA  acceptor  stems  and  the  contribution  of  their  base  sequences  to  recognition. 
These  experiments  use  two  different  types  of  substrates.  Mini  helices  (such  as  those 
used  in  Paul  Schimmel’s  laboratory)  are  being  used  for  studies  of  recognition  of  the 
tRNASer  acceptor  stems  by  seryl  synthetase.  We  have  developed  model  full  length 
tRNAs  for  detailed  studies  of  acceptor  stem  recognition  for  synthetases  that  require 
information  in  both  the  acceptor  stem  and  anticodon.  These  model  substrates  are 
composed  of  a  wild  type  anticodon,  a  standard  body  and  interchangeable  acceptor 
stems. 

Peter  B.  Dervan 
Protein-DNA  Recognition 

The  major  accomplishments  this  past  year  (1990)  are: 

1.  We  completed  our  characterization  of  the  structural  motif  of  the  DNA  binding 
domain  of  the  transcriptional  activator  GCN4  (222-281)  using  the  affinity 
cleaving  method.  In  our  model,  the  positive  dipole  of  the  N-termini  of  the  a- 
helical  leucine  zipper  domains  interact  with  the  phosphate  backbone  of  the 
DNA  as  has  been  suggested  previously.  The  monomers  then  separate  into  two 
arms  which  follow  the  major  groove  of  the  DNA  in  opposite  directions,  placing 
the  N-termini  of  the  DNA  binding  domain  in  successive  major  grooves  on  the 
opposite  face  of  the  DNA  from  the  zipper  region.  {Science  248, 847  (1990)). 

2.  We  are  constructing  hybrid  metalloproteins,  consisting  wholly  of  naturally  oc¬ 
curring  a-amino  acids,  that  oxidatively  cleave  DNA  at  a  single  base  within  13 
base  pair  recognition  sites.  (J.  Am.  Chem.  Soc.  112,  4604  (1990)).  We  have 
found  that  the  tripeptide  GGH  is  a  metal-specific  structural  domain  consisting 
of  naturally  occurring  amino  acids  that  could  be  incorporated  at  the  NH2- 
tenninus  of  a  variety  of  recombinant  proteins  (such  as  other  DNA  binding  pro¬ 
teins,  receptors,  or  antibodies)  with  the  function  of  precise,  efficient  substrate- 
directed  oxidation,  activated  in  the  presence  of  Nidi)  and  peracid  (25“C,  pH  7.5). 

3.  We  continue  our  studies  of  binding  constants  of  DNA  binding  proteins  at  dis¬ 
crete  sites  on  restriction  fragments  using  quantitative  affinity  cleaving. 

4.  In  unpublished  work,  we  have  developed  reagents  for  attachment  of  EDTA  to 
discrete  internal  amino  acid  positions  in  synthetic  proteins.  These  will  be  use¬ 
ful  for  mapping  the  structure  of  protein-DNA  complexes  by  affinity  cleaving. 
(D.  Mack,  P.  B.  Dervan) 

5.  We  have  initiated  studies  of  protein-EDTA  autocleavage  as  a  method  to  deter¬ 
mine  protein  structure  (protein  affinity  cleavage)  in  protein-DNA  complexes. 

DARPA  Supported  Research 


1987  Synthesis  of  a  Sequence  Specific  DNA  Cleaving  Peptide.  J.  Sluka,  M. 
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1990  Structural  Motif  of  the  DNA  Binding  Domain  of  yo  Resolvase  Characterized 
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1990  Characterization  of  ProteinrDNA  Complexes  by  Affinity  Cleaving.  P.  B. 
Dervan,  in  Methods  in  Enzymology,  in  press. 

Leroy  e.  hood 

The  "zinc  finger”  is  a  repeating  protein  domain,  a  structural  motif  that  is 
responsible  for  the  sequence-specific  DNA-binding  activities  of  a  variety  of 
eukaryotic  transcription  factors.  The  zinc  finger  model  (McLachlan  A.  D.  &  Klug,  A. 
(1985)  EMBO  J.  4, 1609-1614;  Brown,  R.  S.,  Sander,  C.  &  Argos,  P.  (1985)  FEBS  Le«. 
186, 271-274)  based  on  analysis  of  Xenopus  transcription  factor  TFIIIA  predicted  that 
conserved  Cys  and  His  residues  act  as  zinc  metal  ion  ligands:  conserved  aromatic  and 
aliphatic  side  chains  were  predicted  to  interact  in  a  hydrophobic  core,  stabilizing  the 
domain. 

The  yeast  transcription  factor  ADRl  contains  two  adjacent  finger  domains.  In 
our  first  approach  to  test  the  structural  and  folding  prediction  of  the  model,  peptides 
encompassing  the  double  finger  (ADRlc)  and  each  of  the  single  fingers  (ADRla  and 
ADRlb)  had  been  chemically  synthesized.  The  folding  and  metal  binding 
characteristics  of  these  synthetic  peptides  had  been  assessed,  as  well  as  the  high 
resolution  three-dimensional  solution  structure  of  both  single  finger  domains  had 
separately  been  determined  employing  2D  NMR  spectroscopy  (COSY,  RELAY, 
TOCSY,  NOESY)  and  the  distance  geometry  program  DSP  ACE  (Hare  Research).  An 
experimentally  determined  model  of  each  of  the  two  single  fingers  is  proposed  that  is 
consistent  with  circular  dichroism,  one  and  two-dimensional  nuclear  magnetic 
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resonance  and  visual  spectroscopy  of  the  single-finger  peptide  reconstituted  in  the 
presence  of  zinc.  Similar  work  with  the  double-finger  peptide  is  in  progress. 

Our  study  revealed  that  in  the  case  of  wild-type  zinc  finger  peptides,  zinc 
binding  is  coincident  with  the  formation  of  a  compact,  globular  domain  containing  an 
amphiphilic  a-helix  and  an  extended  B-like  region.  Whereas  the  two  different  wild- 
type  zinc  fingers  folded  into  one  conformational  species  upon  tetrahedral  binding  of 
zinc,  a  synthetic  deletion  peptide,  deleted  from  a  single  amino  acid  residue  (del.  138- 
ASN)  did  not  bind  zinc  with  tetrahedral  geometry,  nor  did  it  fold  into  a  domain.  This 
result  was  also  found  when  both  wild-type  peptides  underwent  thiol  alkylation  or 
imidazole  participation,  and  spacing  between  pairs  of  ligands  (which  is  conserved) 
are  absolute  requirements  for  tetrahedral  binding  of  metal  and  subsequent  folding 
into  a  domain.  This  provides  direct,  experimental  support  for  the  initial  prediction  of 
the  model. 

Several  point-mutant  single-finger  peptides  of  ADRla  and  ADRlb  have  also 
been  chemically  synthesized.  These  mutants  contain  alteration  of  conserved 
aromatic  and  aliphatic  residues,  which  should  allow  us  to  determine  the  structural 
and  functional  significance  of  the  hydrophobic  core  in  the  folded  zinc-finger  structure. 

Using  genetic  method  several  separate  alanine  point  mutations  have  been 
introduced  in  the  finger  corresponding  to  ADRlb  in  the  protein  and  the  mutant 
proteins’  ability  to  bind  DNA  have  been  screened.  For  the  first  round  of  mutagenesis 
experiments  we  concentrated  on  the  a-helical  region  of  the  finger.  Then  we 
chemically  synthesized  3  selected  alanine  point  mutants  of  the  ADRlb-finger:  in  the 
genetic  experiment  (a)  showed  decreased  DNA-binding,  (b)  showed  increased  DNA- 
binding,  (c)  showed  DNA-binding  that  of  the  wild  type.  The  structural  features  of 
these  and  other  alanine  point  mutants,  hopefully,  will  lead  us  to  an  understanding  of 
how  the  zinc  fingers  are  interacting  with  DNA. 

Publications 

Parraga,  G.,  Horvath,  S.  J.,  Eisen,  A.,  Taylor,  W.  E.,  Hood,  L.,  Young,  E.  T.  and 
Klevit,  R.  E.  (1988)  Zinc-dependent  structure  of  a  single-finger  domain  of  yeast 
ADRl.  Science  214, 1489-1492. 

Parraga,  G.,  Horvath,  S.,  Hood,  L.,  Young,  E.  T.  and  Klevit,  R.  E.  (1990)  Spectroscopic 
studies  of  wild-type  and  mutant  ”zinc  finger”  peptides:  Determinants  of  domain 
folding  and  structure.  Proc.  Natl.  Acad.  Sci.  USA  87, 137-141. 

Klevit,  R.  E.,  Herriott,  J.  R.  and  Horvath,  S.  J.  (1990)  Solution  structure  of  a  zinc 
finger  domain  of  yeast  ADRl.  Proteins:  Structure,  Function  and  Genetics  7,  215-226. 

JOHN  H.  Richards 

Summary 

We  have  made  important  progress  in  our  recent  efforts  to  characterize  the 
three-dimensional  structure  of  the  52  residue  DNA  binding  domain  of  Hin 
recombinase.  The  peptide  can  be  successfully  folded  in  relatively  acidic  buffer  with  a 
small  amount  of  trifluoroethanol  (TFE).  The  unique  folding  conformation  is  now 
being  investigated  by  NMR.  We  have  also  used  circular  dichroism  (CD)  spectroscopy 
to  study  the  secondary  and  tertiary  structure  of  the  peptide  in  various  situations. 
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The  Peptide  Folding  Studied  by  NMR 

Previous  NMR  studies  in  this  laboratory  have  shown  that  the  peptide  in  the 
absence  of  DNA  adopts  an  unstable  conformation  sensitive  to  solution  conditions. 
Detailed  NMR  structure  analysis  of  the  peptide  could  not  be  done  under  physiological 
conditions  (20  mM  phosphate  and  NaCl,  pH  7.6)  because  the  peptide  oligomerizes  at 
the  relatively  high  concentration  necessary  for  NMR  studies.  The  oligomerization 
can  be  prevented  by  adjusting  the  NMR  sample  to  acidic  conditions  (pH  3.4)  in  the 
appropriate  buffers,  yet  the  peptide  still  did  not  adopt  a  unique  conformation  under  a 
variety  of  conditions.  It  was  found  that  a  small  amount  of  TFE  (12%(v)  or  3.4  mol%) 
greatly  stabilizes  the  peptide  conformation  under  the  acidic  conditions  (Fig.  1),  The 
NMR  of  the  peptide  exhibits  a  single  set  of  resonances  with  dispersed  chemical  shifts, 
an  indication  of  folded  and  unique  peptide  conformation.  In  further  NMR  studies,  we 
observed  continuous  stretches  of  peptide  amide  proton-amide  proton  (NH-NH) 
connectivities  (dNN)  in  the  NOESY  of  the  52  mer  peptide  in  water.  These  stretches  of 
NOE  connectivities  are  one  of  the  characteristics  of  a-helical  secondary  structures, 
which  are  part  of  the  helix-turn-helix  model  proposed  for  the  binding  domain.  We  are 
now  in  the  process  of  sequence-specific  resonance  assignments  of  the  52  mer  NMR 
spectrum. 

The  Peptide  Folding  Studied  by  CD 

Under  physiological  conditions,  the  helical  content  of  the  peptide  increases  as 
peptide  concentration  increases.  This  phenomenon  has  been  observed  for  small 
peptides,  and  was  interpreted  as  a  result  of  unstable  peptide  conformation  and 
peptide  intermolecular  interactions  (1).  Under  acidic  conditions,  the  peptide  helical 
content  is  independent  of  its  concentration  in  the  range  of  0.0001  to  0.2  mM, 
indicating  the  absence  of  intermolecular  interactions  (Fig.  2)  at  these  concentrations 
at  pH  3.4. 

A  TFE  titration  (up  to  20%)  of  the  peptide  solution  under  the  acidic  conditions 
observed  by  CD  measurements  showed  that  the  helical  content  increases  as  TFE  is 
added  (Fig.  3),  a  result  which  is  consistent  with  earlier  reports  (2).  Interestingly  the 
CD  spectra  exhibits  a  signal  at  275  nm  which  peaks  at  approximately  12%  TFE  (Fig. 
4).  We  believe  this  signal  is  caused  by  a  lock-out  of  the  tyrosine  rings  in  the  peptide 
in  the  process  of  folding.  Therefore  the  CD  result  coincides  with  the  NMR 
observation  that  the  peptide  folding  occurs  at  an  optimum  TFE  concentration  of  12% 
in  acidic  conditions. 

TFE  has  been  known  to  promote  a-helical  secondary  structures  for  small 
peptides,  but  there  is  so  far  no  report  of  its  stabilizing  tertiary  structures  of  a  peptide. 

Studies  of  DNA-Protein  Binding 

In  the  presence  of  DNA  with  the  specific  base  sequences  characteristic  of  the 
Hin  recombinase  binding  sites,  the  peptide  conformation  is  ^eatly  stabilized,  as 
indicated  by  NMR.  Interestingly,  circular  dichroism  (CD)  examination  of  the  peptide 
does  not  suggest  a  major  change  in  the  composition  of  secondary  structures  of  the 
peptide  upon  binding  to  the  DNA  (under  physiological  conditions.  Fig.  5).  The  a- 
helical  contents  are  about  25%  in  both  situations.  About  the  same  values  are 
obtained  for  the  peptide  in  the  presence  of  12%  TFE  under  acidic  conditions. 

We  propose  that  the  secondary  structural  aspects  of  the  peptide  are  very  similar 
in  both  cases;  folding  into  a  stable  tertiary  structure  depends  more  critically  on  the 
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solution  conditions,  and  very  importantly,  binding  to  DNA  with  the  specific  base 
sequence  of  the  Hix  site  greatly  stabilizes  a  native  tertiary  structure. 
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Melvin  I.  Simon 

We  made  a  great  deal  of  progress  in  the  past  year  primarily  focusing  our 
attention  on  the  specific  interaction  between  the  DNA  binding  peptide  and  its  site.  It 
has  become  clear  to  us  from  a  number  of  experiments  that  in  order  to  completely 
redesign  this  peptide  in  a  useful  fashion  we  will  have  to  have  a  precise  idea  about  its 
three-dimensional  interaction  with  DNA.  In  order  to  do  this  we  have  begun  efforts  to 
get  a  crystal  structure  of  the  protein  DNA  complex  starting  with  the  52  amino  acid 
peptide  that  we  have  synthesized.  As  part  of  this  project.  Dr.  Richards  has  shown 
that  the  peptide  by  itself  in  solution  has  very  little  native  structure,  however,  in  the 
presence  of  the  appropriate  DNA  sequence  the  peptide  is  able  to  assume  NMR 
characteristics  that  suggest  a  distinct  three-dimensional  structure.  Thus,  it  appears 
that  the  correct  DNA  sequence  allows  the  protein  to  fold  up,  to  bind  appropriately 
and  to  assume  a  defined  structure.  This  explains  why  initial  attempts  to  crystallize 
the  52  amino  acid  peptide  by  itself  were  unsuccessful.  However,  we  have  now 
collaborated  with  Dr.  Dickerson  and  Dr.  Reed  Johnson  at  UCLA.  We’ve  prepared 
very  pure  52  amino  acid  peptide  and  different  versions  of  the  DNA  half  site  binding 
sequence  with  one  and  two  base  pair  extensions  on  either  end.  In  initial  experiments 
when  these  were  set  up  to  crystallize  we  were  able  to  obtain  small  crystals  that  are 
still  too  small  to  diffract.  However,  they  appear  to  contain  protein  and  we  are 
currently  working  on  conditions  to  extend  these  studies  and  to  get  crystals  large 
enough  to  obtain  data  regarding  the  three-dimensional  structure  of  the  protein  DNA 
complex.  We  are  also  proceeding  to  work  on  the  three-dimensional  structure  of  the 
entire  enzyme  bound  to  DNA.  Our  collaborator.  Dr.  Doug  Rees  in  the  Chemistry 
Department  here  at  Caltech,  has  just  hired  a  postdoctoral  fellow  who  will  be  working 
together  with  Dr.  Han  Lin  in  our  laboratory  to  purify  large  amounts  of  Hin 
recombinase  for  crystalization  with  the  DNA. 

Characterization  of  Binding  and  Recombination 

In  order  to  fully  characterize  both  in  vivo  and  in  vitro  the  precise  requirements 
for  DNA  protein  binding  in  this  system  a  set  of  40  recombination  sites  which  vary  by 
S3anmetric  pairs  of  single  base  substitutions  from  the  HixC  consensus  recombination 
site  were  constructed.  This  set  was  used  in  the  Salmonella  P22  base  challenge  phage 
selection  to  define  the  DNA  sequence  determinants  for  the  binding  of  Hin  to  DNA  in 
vivo.  Base  pair  substitutions  at  4  symmetric  positions  in  the  HixC  site  were  severely 
defective  in  the  ability  of  Hin  to  bind  to  these  sequences  in  vivo  (see  the  enclosed 
table).  These  include  two  positions  in  the  major  groove  recognition  portion  of  the 
HixC  sequence  and  two  positions  in  the  minor  groove  recognition  portion.  The  base 
substitutions  in  the  major  groove  recognition  which  were  defective  in  Hin  binding 
still  retained  residual  binding  capability  while  the  base  pair  substitution  affecting 
the  minor  groove  recognition  lost  all  binding  capability.  In  vivo  binding  assays  were 
carried  out  with  the  HixC  sites  containing  three  deazaadenine  residues  substituted 
for  adenine  residues  at  positions  5  and  6  of  the  HixC  site.  In  one  construct  the 
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apparent  binding  constant  was  reduced  by  2^fold  when  three  deazaadenine  residues 
were  incorporated  compared  to  the  identical  substrate  containing  adenine  residues. 
These  results  suggest  that  hydrogen  bon_d  contacts  to  bases  in  the  minor  groove  of  the 
Hix  DNA  is  not  a  significant  contributor  of  binding  specificity;  rather  Hin  probably 
recognizes  the  shape  of  the  minor  groove  and  interacts  through  van  der  Waals  and 
phosphate  contacts.  We  will  be  continuing  to  do  studies  on  the  construction  of  DNA 
binding  sites  with  modified  bases  to  define  more  clearly  the  contribution  that  each 
base  and  its  ability  to  form  hydrogen  bonds  with  the  protein  plays  in  the  direction  of 
the  protein  with  DNA. 

We  are  continuing  our  work  on  the  modification  of  the  peptide  and  the 
appropriate  site  to  engineer  new  activities  into  this  particular  polypeptide.  Dave 
Mack  and  Peter  Dervan,  as  part  of  our  group  collaboration  in  the  DARPA  project, 
showed  that  a  tripeptide,  Gly  Gly  His,  when  added  to  the  N-terminus  of  the  Hin  52 
amino  acid  binding  peptide,  was  able  to  bind  nickel  very  tightly.  When  the  peptide 
bound  to  its  appropriate  DNA  site  the  nickel  can  be  activated  by  the  addition  of  an 
oxidizing  agent  and  the  nickel  then  causes  a  break  in  the  DNA  minor  groove. 
Cleavage  by  nickel  is  very  efficient  and  highly  localized  in  the  minor  groove  binding 
region.  We  have  synthesized  the  gene  that  has  the  methionine  codon  followed  by  Gly 
Gly  His  and  the  52  mer  sequence.  We  will  try  to  produce  this  in  E.  coli.  Under  these 
circumstances,  the  methionine  residue  may  be  removed.  As  there  is  independent 
evidence  that  when  a  glycine  follows  methionine  the  methionine  residue  is  efficiently 
removed  from  the  polypeptide  by  endogenous  enzymes  in  E.  coli.  We  will  overproduce 
this  55  amino  acid  fragment  and  assay  for  its  ability  to  bind  nickel,  to  bind  to  Hix 
DNA  and  to  cleave  Hix  DNA. 

Publications  During  this  Period 

Mack,  D.  et  al.  (1990)  Biochemistry  29, 6564. 

Mack  D.  et  al.  (1990)  Biochemistry  29, 655. 

Hughes,  K.  T.,  Lin,  A.  H.,  and  Simon,  M.  I.  Genetic  studies  of  the  requirement  for 
Hin  recombinase  binding.  Studies  on  symmetric  binding  sites.  Manuscript  in 
preparation. 

Glascow,  A.,  Johnson,  R.  and  Simon,  M.  I.  DNA  bending  associated  with  the  Hin 
recombination  enhancer  function.  Manuscript  in  preparation. 


A  (B),  the  peptide  and  the  DNA  mixture  (C),  and  the  difference 
n  secondary  structure  of  the  peptide  upon  binding  is  small.  Samples 


peptide  (H6p2c3  5th  XIO)  5uM.  pH-7.6.  20niM.  1cm:  6-22-90 
DNA  (34)  +  peptide  (H6p2c3  5th) .  5uM  ea.  pH-7.6.  20mM.  6-22 

DNA  (34)  5uM.  pH-7.6.  20mM.  .:  1cm  cell;  6-22-90 
Complex  -  DNA  -  (peptide  -  bkgd) :  difference  CD  spectrum 
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OPERATOR 

hixC  WT 

-1C  +  1G 

-1G  +  1C 

•lT+1  A 

•2A+2T 

-2G+2C 

•2T+2A 

-3A+3T 

•3G+3C 

•3T+3A 

•4C+4G 

-4G+4C 

•4T+4A 

-SC+SG 

•SG+SC 

-5T+SA 

•6C+6G 

•6G+6C 

•6T+6A 

•7C+7G 

-7G+7C 

•7T+7A 

•8C+8G 

•8G+8C 

-8T48A 

•9A<-9T 

•9G+9C 

-9T+9A 

-10A+10T 

•lOC-t-IOG 

•lOG+lOC 

•1  1C  +  11G 

•1 IG4-IIC 

•1  1T  +  1  lA 

-12A+12T 

•12C+t2G 

•12G*12C 

•13A+nT 

•13C<-13G 

-13G+t3C 


SEQUENCE  _ 

TTATCAAAAaoCATGGTTnTGATM 

TH'ATCAAA^ACCCQGGmnGATAA 

■n'AT<>AAAACC!QCGGTTrrTG^TAA 

TTATCAAAAACQL^GGTTTrrGATM 

TTATCAAAMCMIGiGTTTnrGATM 

TTATC^AAAACGATQGTTnTGATAA 

TTATCA^AAACa[ATAGTT^TGA.TAA 

TTATCAAAAA^CATQJXrTTTGATAA 

miX^AAAAQCATGQTITrTGATAA 

mTCAAAA^ICATGA■^T^■GATA^ 

TTATCAAAAQOCiATQQQTTnGATAA 

mTCAAAAQCCATGCCnTrG4JAA 

mTCAAAAICCATGC^TmGATAA 

TCATCA^f^QNXA^02^ST^GA^Afii 

TCATCAAf^QNX^yGGTC^rnG^AA 

TTATCAaAIACCATGGTATrTGATAA 

T^ATCA^CAAa:ATQGTT0T^GATA^ 

TTAT(>VGA‘VXATGGnS2n^^ 

mTCA^I>WXy^TGG^^ATrGATAA 

TTATC^CAAflCCATGGTnfiTGAJAA 

mT(>G4AACCATGGTTlCrG4TAA 

TTATO^AAACCATGGTTTATG^TM 

HATCi^WWXATQGnnGGATAA 

HATC^JAAMlXATGGITnCG^^ 

TTATCIAAAACCATGGTTTTAG^TAA 

TTATAAAAA^CCATGGTTTniATM 

TTATSAAAMCCVVTGGTTTn’fiATM 

HATIAAAMCX^ATGGTTnTMTAA 

Tr/>ACA4AAACCATGGTTnTGITAA 

TTAiXAAWO^TGGTTnTGQTM 

TTAQCAAAA^CX^ATGGTn’rTQCTAA 

TTCTCAAMAOCATGGTTTTTG'Si^ 

TTGTC^AA^ACX^aGGTrmGACAA 

TTXrCAAAAACCATGGTTTrrG^A^ 

TMTCVWVAAa^ATGGTTTTTGMIA 

TCATCAAAAACCATGGTTT^G^TS^ 

TGArCAAAAACCATGGTTTnGMO^ 

ATATaAAAA^CCATGGTTmG4>TAt 

CrrATCAWACCATG3TTrrTG4,TAS 

GTATCAAAAACCATGGrnTTGATAC 
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